SemanticScuttle - klotz.me » klotz: bloom filter

klotz: bloom filter*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

NEXUS DSA — Distributed Search Engine

NEXUS is a production-grade, full-text and semantic search engine built from scratch, implementing advanced data structures and distributed systems concepts. It focuses on probabilistic optimization, sub-millisecond latency, and hybrid AI-powered search. The project demonstrates core technologies like LSM Trees, Bloom Filters, HNSW Graphs, and W-TinyLFU caches, integrated into a high-performance pipeline. It also includes a LeetCode algorithm library with implementations of classic interview patterns and provides insights into distributed crawling and persistent storage.

2026-03-13 Tags: search engine, data structures, distributed systems, lsm tree, bloom filter, hnsw, w-tinylfu, pagerank, algorithm, crawler, performance by klotz
Generic Load/Save Functions - Spark 3.2.0 Documentation

usersDF.write.format("orc")
.option("orc.bloom.filter.columns", "favorite_color")
.option("orc.dictionary.key.threshold", "1.0")
.option("orc.column.encoding.direct", "name")
.save("users_with_options.orc")
Find full example code at "examples/src/main/scala/org/apache/spark/examples/sql/SQLDataSourceExample.scala" in the Spark repo

2021-12-01 Tags: spark, orc, bloom filter, parquet, hadoop by klotz
RedisBloom - Probabilistic Datatypes Module for Redis

2021-03-16 Tags: redis, bloom filter, data, probabilistic by klotz
Approximately Detecting Duplicates for Streaming Data using Stable Bloom Filters

2021-02-11 Tags: bloom filter, duplicates, streaming, approximate algorithms, machine learning, production engineering by klotz
Towards a scalable Bloom filter | Object survivor space

https://www.eecs.harvard.edu/~michaelm/postscripts/rsa2008.pdf

2020-03-16 Tags: bloom filter, hash, java by klotz
Less Hashing, Same Performance: Building a Better Bloom Filter

Adam Kirsch,* Michael Mitzenmacher†

2020-03-16 Tags: bloom filter, hash by klotz
A neural data structure for novelty detection

2020-03-12 Tags: bloom filter, neuroscience, drosophila, ontology by klotz
Bloom Filter-Assisted Joins with PySpark - Tech at Magnetic

2018-04-16 Tags: bloom filter, spark, python, hadoop by klotz
GitHub - willf/bloom: Go package implementing Bloom filters

2018-02-22 Tags: golang, bloom filter, github by klotz
bloom filter - Spark and BloomFilter sharing - Stack Overflow

Example Bloom Filter use in Spark 2.0

2017-04-04 Tags: spark, bloom filter, scala by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle

SemanticScuttle - klotz.me

klotz: bloom filter*

Linked Tags

Related Tags